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1 AGCTTTATAA CCATGTGATC CCATCTTATG GTTTCAATCC ATGCACAGGA 

51 GGAAAATTGT GGGCACGAAG TTTCCAAAGG GAAAATTTAT AGATTGGTAG 

101 TTAATGAAAT ACAGTTTTCC TCCTTGGCAA ATTTAATTTA CTAg/tTCAC 

151 TGTATAGGAA AAAGCAGGAA AAAAATTAAA ACCAACTCAC CTCCAAACCT 

01 GTTTTGAGCT TTTACTTGTC TGCCCAATTG ATAGTTTCTA CTCTCTGCTT 



251 TTGATGAAAA TATTTTTTAT TATTTTAATG TAACTTCTGA AAACTAAATT 
301 ATCTAGAAGC AAATAAAAAG ATATTGCTTT TAT^TTCCC AGAAGGAAAA 
351 AACAAACACT AGGAAAGTTC TAT CTATCAG A/TGGGGGAGA TGTGATGGAG 
401 GCAGTGATAT TTGAGCTGAG CCTTGAACAA TGAACAGGAG TCTACCAAGC 
451 GAGAGGCTAG CGGGTGGCCC TCAAgXa ACAACAGCAT GTACAAAGGC 
501 AT GG AG ACAT ACACATCTTG ACTOTTCCAG GAATGGTGGG AACGCTGGTG 
551 GAGCTAGAAT GTAGGTACAT A^CATAAAGT GGCAGACGGG AAGCCTTTGG 
601 AAATCTTATT ACATAGGACq/cTGGATGCCA TTCCAATGAC TTTGAATTTT 
651 CTGTAGGCTG CCAGCgX TTCCAAGCGT GATAGAGTCA TGTCTATCTA 
701 TGCACTTCAG AAAGA^ACC TCAGGGTTAA TGAAGAAAAT GCATTGGAAT 
7 51 ATAAGAAACT GGT^CCAGA GTGATCAATT GCATGACTGT TGTGAAAGTC 
801 CAGGTGAGGG g/gCTGTGGG CAAGGTCAGA GTTGAGAGGC ATTTCAGAGA 



85i taaaatgacaZtaactaagt AGATGTCAGG CTGAGAAGAA AGGGCTGTAC 

901 CAGATATATG GTGCTATCAT TAAGTGAGCT CAACATTGCA GAAAAGGGGT 

951 aggttt/gtg GGAGTTGCTC ACAAAACATG TTTAGTCTAA GCAAAACCAT 

1001 TGCCATGGGC T CAGAT AAAA GTTAAGAAGT GGAAACCATT CCTACATTCC 

1051 ta'taggagct GCTATCTGGA AGGCCTAGTA TACACGTGGC TTTTCAGCTG 

1101 /tgattttgtt TGATTTTAGG GATTATTCTT TTTCTGAATC TGAGCAATGT 
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1151 TAGCGTGTAA AATACTCACA CCCACAGCTT TGACTGGGTG AGAAGTT AT C 
1201 ATAAATCATA TTGAGTTTGT TGTGATACCT TCAGCTTCAA CAAGTGATGA 

1251 GTCAGGTCAA CTCCATGTGA AAGTTCCTTG CTAAGCATGC AGATATTCTG 

/ 

1301 AAAGGTTTCC TGGTACACTG GCTCATGGCA CAGATAGGAG AAATTGAGGA 
1351 AGGTAAGTCT TTGACCCCAC CTGATAACAC CTAGTTTGAG TCAACCTGGT 
14 01 TAAGTACAAA TATGAGAAGG CTTCTCATTC AGGTCCATGC TTGCCTACTC 
14 51 CTCTGTCCAC TGCTTTCGTG AAGACAAGAT GAAGTTCACA GTGAGTAGAT 
1501 TTTTCCTTTT GAATTTACCA CCAAATGATT GGAGACTGTC AATATTCTGA 
1551 GATTTAGGAG GTTTGCTTCT TATGGCCCCA T CAT GG AAAG TTTGTTTTAA 
1601 AAAAATTCTC TCTTCAAACA CATGGACACA GAGAGGGGAA CAACACACAC 
1651 CAGGTCCTGT TGGGGGGTGG AGAGTGAGGG GAGGGAACTT AGAGGACAGG 
1701 TCAATAGGGG CAGCAAACCA CCATGGCACA CATATACCTA TGTAACAAAC 
17 51 CTGCACGTTC TGCACATGTA TCCCTTTTTT TTAGAAGAAG AAATAATGAA 

/ 

1801 AAAAAACCTT TTTTCTATTT ATATAATCAT GGCATTTATA AGCATCTCTA 
1851 TAGAGAAGGA TAATt/tGCT GAGATTAGAC AGCTGTCTGA GCACCTCACA 
1901 CTGACCTATT TTTAACAAAA TGACTTTCCA CATCACCTGA TTTCGGCTCC 
1951 ATGCRGGGTA AGCAGTTCCT AAGCCCTAGA AAGTGCCGAT CATCCCTCAT 
2001 TCTTGAATTO CTCCTTTTAT TTACCAAAAT TCCTGAGCAT GT T CAGG AAA 
2051 GATGAAAA'GC TTATTATCAA AATAAGTGGC TGAGATAGAC TTCTTGTCAC 
2101 ATTTGTTACA GTAAAATGGG TCTCCAAGAA AGAAAGATTT GCCTTGGGCT 
2151 CTAGCATGGC CATTTATTTA AG AAAGC AT C TGAAACATGA AGCTACCACA 
2201 GCATCTCTCC TGTGGTTCCA GACGGAAGCC TGAGAGTCTA GGAGGAGGTG 
2251 GACCGAGAAA CCCTGCCAAA GTAACTAGTA GTGCCGGGTT TCTCACAACA 
2301 CGATGCAAAG GGGCTAGAAT CAGATGACTA TTTTCATGTT T CAACAT ACT 
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2351 ACACACTGGA AAACGTTACG GCAGACTCTA CTTTATAATG GGGCTGCAAA 
2401 TGTAAAATGA CTACTAGAAC TAGGTCCTCT TAATAGCAGC AAAGTTTAAA 
2451 AGGGTCAGAG GGAGCTCCAG ACACAGGTTA GATTTGATTT CTCTCCTAGT 
2501 TCTGCTGTGA ACAAGAGGTA TAAGTTTGGC CAACTCACTT AACCCCTGAA 
2551 GCTCAGTTAC CTTATCTGTA AAATGATTGC ATTGTACTAG GTGTTCTCTA 
2601 AAATTTCTTC TACCTCTGAC TTTTTAGGAG ACTAATTTTT AACTCCTTTT 
2651 TAAGCTATTG GGAGAAAAAT TTAATTTTTT TTCAAAAGTT ACCTTGAATC 
2701 TCTAGAGCAG TTCTCAAAAC TATTTTGTCC CAGGCAAAGG AAATGAGACT 
2751 AGGTACCCAG AATGAGGCAC CCTGCATAAA GCTCTGTGCT CTGAAAACCA 
2801 ATGTCAGGGA CCCTGTGATA AATAATTAAA CCAAGTATCC TGGGACACTG 
2851 CTAGTGACAT CGCCTCTGCT GATCACTCTT GCCAGCGAGA CACTCTATAC 
2 901 TTGCTTTCTC ATCATTGGCA TCCAAACTGC CTACTAATCC ATTGCTTTGG 
2951 AAAGTTTTTT TTAATAAAAA /GATTATTTCT ATTAGGAGGA AAACATCCCA 
3001 TGTTAAATAG GAAAATTAAC TGAAATCATT TTCAGATGTG ATTTTTAGCA 
3051 CTTATAGCCA TTTCAAACCA TGGTATTCAT TTATACTATG CTATTTATTG 
3101 TAAAACTTCT TTTTTTTTCC AAGGAAAATA AGATAGTTTG CTTTATTTTA 
3151 AAACAGTAAC TTTCTTATAT TGGGGCACTG ACCAAAATTC AATACTGGTA 
3201 CAAATATGTT ACCTAGGGGG TCAAAATATG TGCCAGGTGA ATTTTCTGAA 
3251 TTTCTCTAAA GAGAGAATTT TAAACCTTAT AAAACAATTA GAAACAAGTG 
3301 AGTGAGAGGT GAGCATCAAC AACCTGTGTA AC AT AAGCC A CAGTACAAAT 
3351 TTAAGCTGAA TAACCAAGCC ATGTCAGTTA TCCCAAATCA TTTTTGTTAA 
3401 TATTTAGGAG GATACACATA TTTTCAATAA CTTAAAAGTG AATCTTTACT 
3451 CCTATCTCTT AATACTCGAA GAAGTATAAC TTTCTTCTTT TACTAGATTT 
3501 AAATAATCCA AATATCTACT CAAGGTAGGA TGCTGTCATT AACTATAGCT 

FIG. 1 Cont. 



3551 GAGTTTATCC AAAATAGAAA AATCATGAAG ATTTATAAAG CATTTTAAAA 
3601 ATAATCATTT ATAGCAAGTC CTTGAAAGCT CTAAATAAGA AAGGCAGTTC 
3 651 TCTACTTTCT AATAACACCT ATGGTTTATA TTACATAATA TAATTCAACA 
3701 AAACAGCATT CTGACCAATG ATAATTTATA GGAAATT CAT TTGCCAAGTA 
37 51 TATGTTTTAT TATAAAGTTA ATATTTTGAC CAATCTTAAA AATTTTTAAA 
3801 CTCTATTCTG ACATTTCCAG AAGTATTATC TTAGCAAGTC ATCTTTATGA 
3851 TACCACTTAT TAAACTGAAG AGAAACAAGA TGGTACATTC TGGGTTTTAC 
3 901 TTTAAAAGGG ATTTGATTCA ATAATTTGAT T TAT C ACT AC TTGAAAATTA 
3 951 CATTTTCTTC CTCAGACTGG ATGGCAATGA GATGAAAGCA GCTTTCCTGG 
4001 CTCTCAACTT CCCTTCTTCA TCAATTTTTC CAGCGTTTCA TAAGGCCTAC 
4051 ACTAAAAATT CTAAAACTAT /tATCACATT AATATAATTA CTTATAATTA 
4101 ATCAGCAATT TCACATTATC GTTAAAACCT TTATGGTTAA AAAATGCAAG 
4151 GTAAGAGAAG AAAAAAa/aC ATTGAACTAG AACTGAACAC ATTGGTAAAA 
4201 TTAGTGAATA CTTTTCATAA GCTTGGATAG AGGAAGAAAG AAGAC AT CAT 



4251 TTTGCCATGT AACa/gAGAC CAATGTTATT TGTGATTTCA GATTGTCTTT 
4301 GCTGGACTTC TTGGAGTCTT TCTAGCTCCT GCCCTAGCTA ACTATGTAAG 

/ 

4351 TCTCACCTTT TCAAGTTTGC TACCAAAATG CATTTGCAAG GAAATGTGAT 

4 4 01 ATTAAATCAC TCTCAATCTC TTATAAACTT CAGAATATCA ACGT CAATGA 

4451 TGACAACAAC AATGCTGGAA GTGGGCAGCA GTCAGTGAGT GTCAACAATG 

4501 AACACAATGT GGCCAATGTT GACAATAACA ACGGATGGGA CTCCTGGAAT 

4551 TCCATCTGGG ATTATGGAAA TGTAGGTAGT CAACGTGCAA TTTTCACTTT 

4 601 ATTGTTTAAA AATACGACTT CTTTTTAACA AAAAATGTGC ATGTTAACCA 

4 651 /taaagaaatt AAAAATAAAT TCTAATTACA CATAGCATAC AGTTATAAGT 
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4701 AAAGGTGACC ATTTTGCTCA TCCGATTTTG TTCCCTAGAG ATAACTACTG 
47 51 TTAATAAGTG TTGCATGATC AGTTAAAATT CAAACCAACA AACACTATGT 

4801 TCAAGGGATT GTGGGTATAT ACAACAAATA TGAACATCCT TTTGCCTTGC 

/ 

4 851 CTGCAGATAC CCTCAATAAT GCTGAAAGAC TTATACAACA TTACTGCTTC 
4 901 CAAAGCTTAG ACTATCTCAC TTTGTTTTCA AAGGAGGTTT TACGACCTTC 
4 951 TAAAGAGATT GAAATTGACA TTTCACCTAA AACTCGGGAA ATGTAAATGA 
5001 CAATATTAAT TGGTAAGAGA GGAAAGAAGA AAGAAAGAAG GAAGGAAAGA 
5051 AAGAAAGAAG GAAGGAAGGA AAGAAAGAAA GAAAGAAAGA AAGAGAGAGA 
5101 AAGAAAGAAA AAGAAAAAAG AGAGAAAGAG AGAAGGAAAG AAAGAGAGAA 
5151 GGAAAGGAAA AGAGAAGCAA AGAAAGAGAG GAGCAAAGAA AGGAACACTT 
5201 AGCACTAGTT GGGAGACCCA ACT CTGGAAT TAT CAG CT AT ATATTTAACA 
5251 AACGTTATAC TTTTAAATAG CAAACTCTTT ATTGTTTCAA TTTTATCTGG 
5301 TCAATTGGAA AAATAATTTT TGTCTTATCT GTCTCCTTGA AATGTGAGGA 
5351 TCAAAGGAGA CTAAAACATG ATAGCTTTTA AAGTCTATTT CAGTAAAACA 
54 01 GACTTATATA GAGGGGTTTT TAT CAT G CT G GAACCTGGAA ATAAAGCAAA 
54 51 CCAGTTAGAT GCTCAGTCTC TGCCCTCACA GAATTGCAGT CTGTCCCCAC 
5501 AAATGTCAGC AATAGATATG ATTGCCAAGC AGTGCCCCAT CCAGTGCTCT 
5551 TATCCCAGCT CATCACGATC TTGGAGTTCC CATTTCTCTC TGCAGGTGGA 
5601 ACTGACCTCT GATAAGAAAA GCTCCTCGGA GAACACATGC CTCACTATTT 
5651 GCCATCTACT TTAACAGGGC TTTGCTGCAA CCAGACTCTT TCAAAAGAAG 
57 01 ACATGCATTG TGCACAAAAT GAACAAGGAA GTCATGCCCT CCATTCAATC 
5751 CCTTGATGCA CTGGTCAAGG AAAAGAAGGT AAAAATAAAA GGCTTTTTAT 
5801 TTTTGGTGAG GGGAGAGGTT TTACATCCTT CAGTAAATAA CGAGAAGATC 
5851 ACAGTCATTC CCTCTTGACT ACAGTATGTT GTAGTGTGCA GCACAAAGGG 
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5901 GGAAGTTATT GGTGATTGCC TGAGGGAAGG CAACTTCTGC CACATCAAAT 
5951 GCTGTGGCTC ACACCTACCT CTACAACCGC TGAGCAAAGC ACTTGAAACC 
6001 TTGACTGTTA GAGGAGCAAA GCTCTGGTCA CACCAATAGG AGCCTCAGTA 

/ 

6051 CTTTGCCAAG GACATTTTTC TGCAAGAGTT AGTTAGGGTT ATTAGATTTA 
6101 GCAAATGAAA ATAGAAGATA TCCAGTTAGG TTTGAATTTT AGGTAAGCAG 
6151 CAGGTCTTTT TAGTATAATA TATCCTATGC AATATTTGGG ATATACTAAA 
6201 AAAAGATCCA TTGTTATCTG AAATTCAAAT GTAACTGGGT ATTGTATATT 
6251 TTGTCTGGCC ATACTAATCC AGGTGAGTGG AAAGAAGAGA TCCATAATGT 
6301 TTTAAAATAT TTGCCTGAGT TCATATTCCT ATAACTGATA AATGAGTACC 
6351 TTTCATTGAC AAGGTAGAGA AAATAAATAA ACTGCATTCT CAGAAGATGA 
6401 TTATTACATA GTCTAATCCA AGGAATCTAT GAT GAC C AAA TGAGGTCCAA 
6451 GTTGCAGAAT AAATTAAGCC TCAGACTTCT GTGTTTATGA GAAGCTGAGG 
6501 TTTCAAACCA GGTAAATCCC TTAGGACACT TAGAAATGCT AAGATATACA 
6551 GAATAAGCTA GAAATGGCTC TTCTTCATCT TGATTATGGA AAAATTTAGC 
6601 TGAGCAACAC TCACTGTTGG CCTCGTATAC CCCTCAAGTC AACAAACCAC 
6651 TGGGCTTGGC ATTCATTCTC TCCCATTCTT CCTTTCTACC TCTCTTTTCC 
67 01 ACACTCAGCT TCAGGGTAAG GGACCAGGAG GACCACCTCC CAAGGGCCTG 

67 51 ATGTACTCAG TCAACCCAAA CAAAGTCGAT GACCTGAGCA AGTTCGGAAA 
6801 AAACATTGCA AACATGTGTC GTGGGATTCC AACATACATG GCTGAGGAGA 

68 51 TGCAAGGTGA GTAGCATCCC TACTGTGCAC CCCAAGTTAG TGCTGGTGGG 
6901 ATTGTCAGAC TATCCTCGCG CGTGTCCATA GTGGGCACCA GTGATGCAGG 
6951 GATGGTCATC AAGGCCAACA TTTGTGCAGT GCTTGCTCTG TGCCAGGTAC 
7001 TGTTCTATGT GCTTTAAGTG TGTTAACTCG GTTCTTCACA GCAATCTTAT 
7 051 AGGTTCTATT TTAATCCTAC TTTATGGATG AGGAAACTGA GGTACAGAGA 
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7101 GGTCACAAAA TCCTTGCCTG GGTCAATTCC AAGCATTTTG GCTGTGGATT 
7151 CTGTGCTCTT AAATATTATG GAACACTGCC TTTTAAGTGT GAATCAAGAG 
7201 TAGACTCAAG TCATATTCAA AAGAATGCAT GAATGGCTAA AT GAAAG AAG 
7 251 AATGCTAATA GAATCTATTA ACTTTCTATA G CT C AG AC AA TCACTTAATT 
7301 TCTGGACATT CAAAGAACAG CTGCACACAA ACAAAGTGTC TACCTAGGGA 
7351 CCTAACTTAA TGGCAATTTT CCAGATCTCT GAATTGATTG ATTT CAT CAC 
7 4 01 AACAAGTAGA TAAACCTTGA CATTAGCACA TAGCTAGTTT GGAAACCCCT 
7 451 ACTCCCCCAA TCCCCTCCAA GAAAAGAGTC CTTAAATAGA CATTAATATA 
7 501 GGCTTCTTCT TTTCTCTTTA TTAGAGGCAA GCCTGTTTTT TTACTCAGGA 
7 551 ACGTGCTACA CGACCAGTGT ACTATGGATT GTGGACATTT CCTTCTGTGG 
7 601 AG AC AC GG T G GAGAACTAAA CAATTTTTTA AAGCCACTAT GGATTTAGTC 
7 651 ATCTGAATAT GCTGTGCAGA AAAAATATGG GCTCCAGTGG TTTTTACCAT 
7 7 01 GTCATTCTGA AATTTTTCTC TACTAGTTAT GTTTGATTTC TTTAAGTTTC 
7751 AATAAAATCA TTTAGCATTG AATTCAGTGT ATACTCACAT TTCTTACAAT 
7 801 TTCTTATGAC TTGGAATGCA CAGGATCAAA AATGCAATGT GGTGGTGGCA 
7851 AGTTGTTGAA GTGCATTAGA CTCAACTGCT AGCCTATATT CAAGACCTGT 
7 901 CTCCTGTAAA GAACCCCTTC AGGTGCTTCA GACAC CACT A ACCACAACCC 
7 951 TGGGAATGGT TCCAATACTC TCCTACTCCT CTGTCCACTG CTTAA 
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1 CATGCTTGCC TACTCCTCTG TCCACTGCTT TCGTGAAGAC AAGATGAAGT 
51 TCACAATTGT CTTTGCTGGA CTTCTTGGAG TCTTTCTAGC TCCTGCCCTA 
101 GCTAACTATA ATATCAACGT CAATGATGAC AACAACAATG CTGGAAGTGG 
151 GCAGCAGTCA GTGAGTGTCA ACAATGAACA CAATGTGGCC AATGTTGACA 
201 ATAACAACGG ATGGGACTCC TGGAATTCCA TCTGGGATTA TGGAAATGGC 
251 TTTGCTGCAA CCAGACTCTT TCAAAAGAAG ACATGCATTG TGCACAAAAT 
301 GAACAAGGAA GTCATGCCCT CCATTCAATC CCTTGATGCA CTGGTCAAGG 
351 AAAAGAAGCT TCAGGGTAAG GGACCAGGAG GACCACCTCC CAAGGGCCTG 
4 01 ATGTACTCAG TCAACCCAAA CAAAGTCGAT GACCTGAGCA AGTTCGGAAA 
4 51 AAACATT GCA AACATGTGTC GTGGGATTCC AACATACATG GCTGAGGAGA 
501 TGCAAGAGGC AAGCCTGTTT TTTTACTCAG GAACGTGCTA CACGACCAGT 
551 GTACTATGGA TTGTGGACAT TTCCTTCTGT GGAGACACGG TGGAGAACTA 
601 AACAATTTTT TAAAGCCACT ATGGATTTAG T CAT CTGAAT ATGCTGTGCA 
651 GAAAAAATAT GGGCTCCAGT GGTTTTTACC ATGTCATTCT GAAATTTTTC 
7 01 TCTACTAGTT ATGTTTGATT TCTTTAAGTT TCAATAAAAT CATTTAGCAT 
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1 mkFTIVFAGLLGVFLAPALANYNIDVNDDNNNAGSGQQSVSVNNEHNVAN 50 

51 VDNNNGWDSWNSIWDYGNGFAATRLFQKKTCIVHKMKKEVMPSIQSLDAL 100 

.01 VKEKKLQGKGPGGPPPKGLMYSVNPNKVDDLSKFGKNIANMCRGIPTYMA 150 

.51 EEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 185 
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1 G AATT C AAAC AGCAGGCCAT CTTTCACCAG CACTATCCGA ATCTAGCCAT 
51 ACCAGCATTC TAGAAGAGAT GCAGGCAGTG AGCTAAGCAT CAGACCCCTG 
101 CAGCCCTGTA AGCTCCAGAC CAT GG AG AAG AGGAAGGTTG TGGGTTCAAG 
151 GAGCTTTTCA GAGTGGAAAT CTGTGGATCA GTGATTTATA AAACACAGTT 
201 TCCCCCTTTA TTAGATTTGA ACCAGCAGCT TCAGTTGTAG AAGAGAACAG 
251 GTTAAAAAAT AATAAGTGTC AGTCAGTTCT CCT T C AAAAC TATTTTAAAC 
301 GTTTACTTAT TTTGCCAAGT GACAGTCTCT GCTTCCTCTC CTAGGAGAAG 
351 TCTTCCCTTA TTTTAATATA ATATTTGAAA GTTTTCATTA TCTAGAGCAG 
40 1 TGGTTCTCAT CCTGTGGGCC ATGAGCCCTT TGGGGGGGTT GAACGACCCT 
45i TTCACAGGGG TCACATATCA GATATCCTGC ATCTTAGCTA TTTACATTAT 
01 GATTCATAAC AGTAGCAAAA TTAGTTAGGA AGTAGGAACA AAATAACGTT 
551 ATGGTTGTGG TCACCACTAT GTTAGAGGGT CCGCAGCATT CAGAGGGTTG 
601 AGAACTGTTG TTCTAGAGGC AAATAAGAAG ACAGAGTTCC TTGATAGGGC 
65 1 CCAGAGGCAG TGAAAGAAGT TTCCACGTAG AAAGT G AAG A AGGTCTGGTG 
701 TCCGAAGCAG TGAGGAACTT AAAAAAAGAA AACCAAAAAC ATTGCCAACT 
75 1 AACAGT C C AG GAGAAGAGCG GGGCATGAAA GGCTGAGTTC CCATGGGATG 
SOI CCTTGAATGG AATCAGAGTG TGGGAAAATT GGTGTGGCTG GAAGGCAGGT 
851 GCCGGGCATC TCAGACGCTG GTAGCTGGGG AAACAGGAAA CCCCTTTAGG 
90 1 AT CCCAAGAT GCCATTCCAA TGAGCTTGAG ATTTTTCTCA TGGACTGCCA 
95 1 GTGAATGTTT CTACGCTCCG GAAATTAATG TTTACTTATT TTCCATATTC 
1001 TAGGGGAGAA CCCTGGGAAA AATGGAGGAC ATTCATTGAA ATATCTGAGT 
1051 CCTGGGATAA GGCAGGCTTG GTCCTACAAC TCTGGTAAAA GTCCATCAGG 
1101 AAGTGCCTTG ACCAAGGCTG GAGTGGAGAG CTGTTGGTGA GATGTAAGGG 
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1151 CAAGGTTTAG TTGCTAGATA TGTAGATGGC AAGATGGTGC TGCCAACAGC 
12 01 CCCCAGAGCT CTAACCCACT GAGAAACCCA GGAATGAATG ATGGGAGATG 
1251 GCTTTGGTGC CAGCTGCTAG TGACATGGCT GGAAAGCTGC ACTGGCTTCG 
1301 AGGCCAGACA ATTCCTCAAG GAAACATCTG GCCAGGGTGC AAGGGCCAGT 
1351 TTCCTTCCTT GGAGTTCCTT TCACAGCTAA GAACATCATC CCCCAACCAC 
1401 TGGTTTTGTT AAAAAGTTTT CAGTATGACT TGAGCATGGT CAAGAAGCAT 
14 51 AGAGAGGGGG AAATAAGGGT GGAAGGAGCT GGAGAAAGCT TACAATAGGA 
1501 CTGGGTAAAG GGAAGGAGAA GAAACCATTC CCGCATTCCC ATAGGAGCCA 
1551 GTACCAGGAA GGGCAGGTGT ACACACAGAT CTCATCTAAG GCCATGTTTG 
1601 GTTTAGGGAT TACTCTTCTC CCGAATCTGA G C AG C AG C AA TACGTAAAAT 
1651 ACCCACACCC ATGGCTTCCA TATTCCAGAA CTTATCACAA ACCGTGTAGA 

17 01 GTTTACTGAG ATACCTTCGT CAGAGGATGA GTCAGAGGCC TCCTGCCTAA 
1751 GGGCCCTACT GAGCAGGCAG CTAAAGGCTT CCGGGCCTCT GCAGCTCCAC 
1801 AGATACAGGA GAGGGAAGCA GATAAGCCGT GGACTCCACC TGAGCACACC 

18 51 TAGCTTGAGC AAAGCTGGTC AGGTACAAAT AGCAGAGGGC TGAATGTCTG 
1901 TGAGCACGCC GCCTGATCCT CTGCTCCACC ACACTCCTGC CGCCATGAAG 
1951 CTCACAGTAA GTCAGATCTT CTTTTCAATG CAGCACCATA CAACATTAAT 
2001 AGTCAGGGGT GAGGGGGTCT GACTCTTACG GCACTGTTAC CATAGTGGAA 
2051 ATATTCTCCT TTCTTTTCAT GGAATCATGG TGTTTACAAG CATGTCCATA 
2101 GAGAAGAAGA ATTGCCCCGG AAGAGCCTGT CACAGGCTGA ATACTGTAGA 
2151 ATTGTCTTTC ACACCATCTG TTCCAAGGTT CTACTTAAGA CGAGCAGTCT 
2201 GTGGGCTCCA GAAAGAGTCT TTCTTAGCCT TGATCTCTTT CTTATTTCTG 
2251 ATTTCTCCTT TCTTATCCAT GATTTCCACT TTTACCAGTT CTGGGCATGT 
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2301 TCCGGTCAGA CTGGAAGATC ACTGTTGTCA AAACTAGTCT TCAACACTCT 
2351 TGGCTGTTAA CATGAAAACA ACGGTCCTTG GGCCCTGTGC AAGCATTTCT 
24 01 TGGAGAAAGT CTCTGGGGAT GAAGCTATCT CAGTTTCCCC ACTGAAGTCC 
2451 TAGGATACAG AGGCTCAAAC AGAGTGCACA TATTCAATTT CAGCATACTC 
2501 TATTGGCGCT GCTTTATGAA TCATATGAAT TTATGGAATT GGAAATGTAA 
2551 ACTATGACCA AGAAGCGTCC ACCTCAGAAC AGGTTGGGTG GGGAACTCCA 
2 601 AGCACAGGCC AGAGGGCTGC GTTTCTCTTC TAGTTCTGTC TAGAGGAGTG 
2 651 GTTCTCGACC TTCCTAATGC TGTGACCCTT TAATACAGTT CCTCACGTTG 
27 01 TCGTGACTCC CAGCCATAAA ATTACTTTCA TTGCTACTGC ATAACTGTAA 
27 51 TTTTGCTACC ATTATGAGTT GTAATGTAAA TATCTGATAT GCAAGATACC 
2801 AGATAACCTA AGAAACGGTT GTTTGACCTT TAAAGGGGTC ACAACCCACA 
2851 GGTGGAGAAC TACTGGTCTA GGGTCCTTTA CAGTCCTTTA GCTGCCTCAT 
2901 TTACAGGAGA TAACATCATG CTCAAAAACT CCCTCCACAT TTGGCTTTTT 
2951 GGGTTGTTTT GTTTTGTTTT TCAAGACAGG GTTTCTCTGT GTAGCCCTGG 
3001 CTGTCCTGGA ACTCACCTTT GTAGACCAGG CTGGCCTCGA ACTCAGAAAT 
3051 CCGCCTGCTT CTGCCTCCTG AGCGCTGGGA TTAAAGGCGT GCGCCACCAT 
3101 GTCTGGCTCA CATCTGGCTT TTTAAGAGAC CGATTTTAAC TTCTTGCATT 
3151 GAAAATAAAT ATAGTAGAAA TGCTTAACCT ACTAAGACAA TAAAAACAGG 
3201 ATTCCTTCTG CTAGGAAGAA CACGTTCCAG ACTAAGGAAA AAAACCTTTT 
32 51 CAGGGCTTTC ATTACACTGT GCCATGCACT AATTTTATGT TTTCTTCATC 
3 '301 AGTTTTCAGT GTCTGAAATT CAGTGTCAAA ATTCTAAGAC TACATATGAA 
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3351 TATCATTACA GTAACTCAGC AATTCTATGT TACCAGTAAG TTTTTCTGTA 
34 01 GTTTAAAAAA AAGGTGGAAG AAGAAAGCAC AGATAGTTTA GCACATGGGT 
34 51 AAAATCAGTA ACTATTTCTG ATGAGCTTGG TGAAGATGCT GTAAACCATG 
3501 CGACCACCAG TCCTGTTCTC TGTGCTTTCA GATGTTCGTC GTGGGTCTGC 
3551 TTGGCCTCCT TGCAGCTCCT GGTTTTGCTT ACGTAAGTCT CATTTTTCTG 
3601 AAGTTCATTG TCAAAACTGC ATTTACAGTG AAATGTGATC TTAAGTCACC 
3651 CTCTGCTTCT TATGAACATT AG ACGGT CAA CATCAATGGT AATGATGGCA 
37 01 ATGTAGACGG AAGTGGACAG CATTCGGTGA GCATCAATGG TGTGCACAAC 
37 51 GTGGCCAATA TCGACAACAA TAACGGCTGG GACTCCTGGA ATAGCCTCTG 
3801 GGACTATGAA AACGTATGTA ATGGACACAC AGGGTAAAGA TATGGTGTAG 
3851 CCACCACCCA TTAAAATTTC TGAGGTGAAT TCTAGCTGTT CAT G AAC AT T 

3 901 AAAAGCTACC AGTAAAAGTG CCCATTCCAC TCAAAACAAT TTTACTTTTT 
3951 TGCATATAAT TATTGCTAAT AAGTATTACA CAATAGGTCG AAATTCAAAG 

4 001 GGATCAATAG TAAGGATAAA AACTATGTAC AAAGACAAAC ACAGCATCCT 
4051 TTGGTCTTCC CTGCAGAGAG TCTCCATGAT GTTAAAGGTC CAATGTTTTA 
4101 TGGAGGCTGA ATGAAATACG AATGCCTCTG TGATGGAAAA GGCCCAACAT 
4151 CTTATGGAGA ATGAGTGAAG TATGAATGCT ATTAGTTGTA AGAGAAGGCG 
4201 ATGCAAAGCA ACACTTGGCA CCACCTGCCA ATTACTACTT TCCTATTTAA 
4251 ATGTAGTTTA AAAAGCAAAG CCTGTCTTCC CTGCCTCCTG GAAACACTGC 
4301 GGATGGAGGT AGACCAAGGT ATGACAGCCT TTAAAAGTTT GTCAGCAAAA 
4 351 CACTCCCCCA TACACACATA CACACACCCT CCTACTACAC TGGAACTGAA 



FIG. 4 Cont. 



4 401 GCAAAGGCAG TGGGTTAGAT ATATCCACCC TCTAAGAGTT TGCAGGTCAT 
4 451 CTATATATGA TAGCCAGAGA CACAACTGCA GGACAGCCAG ACTCTGAGCA 
4 501 CTCTCCCCAG CTCCTTGTAG CTCTGTTTCA GTGGTGACTT GTGACAAGAA 
4551 TCCTGGGGAA CCTGTGCCTC ACTGTTCTCT GTCTTCTTTA ATAGAGTTTC 
4 601 GCTGCCACGA GACTCTTCTC C AAGAAGT CA TGCATTGTGC ACAGAATGAA 
4 651 CAAGGATGCC ATGCCCTCCC TTCAGGACCT CGATACAATG GTCAAGGAAC 
47 01 AGAAGGTAAA GTCCTGCCTT CTTCTTTGGA GTGACAGGAA GTCTTACAGT 

47 51 CTCCAGTACA CAGTGAAGTC ACCCCCATTC CCTCTTTGGT GGAGCATGAC 

48 01 AGCATGTTTG T CAT GAT AAA TGCCACAAAC ATGTAAAACT GTTCAGTGTC 
4851 TGCCTGAATG GAGGGT GGCT TCCACTGTGT CAGATGCCGT GGCCCACATC 
4 901 TGCCTCTGCA GGGTCCAGTA AAGCACTGGC TATCTTGAGT GTCAGAGACC 
4 951 CAAAGGTCTG TACACTTCAG TACAAGCCCT CCATATTTCA AGGGCACACT 
5001 CCTACAGTCG TTGGGGTTAT CAGAACTAGC AAACATAGAG ACTGGATTTT 
5051 CAGATGAAAA GAAAT CCTTT TTAAAGTCTA AGTATGCCTT ATACAATGTT 
5101 TGAGATATTC TCAATACTAA AAAAAAAAAA ATTGTTGCTT GCTTGAAAAT 
5151 CAAATGTAAC CAAGT GTCCT ATATCCAGTG TCAATCATGG CTGTAGTAGA 
5201 TGGGAAGAGG GAGCCCGTGG TTTTCACAGT CAGACGCCTG AGTTATTCTT 
52 51 CTAAGTGATA AATTGGTTCC TATAACAAGC AAGCCAGTGA ATATAAATAA 
5301 GCTCTATCTC AGAAGTTATC CTGTAGTGCT ACCCTAGAAT CTAAGAGAGC 
5351 AAAAGTGCTT CAAATTTCAG AATAAGTTTT GCTTTGGACT TCTGTTTTTC 
54 01 TAAACAACTA TAACTTCAAA CCATCTAAGC CTCGTGGGAC ACTTAGAAAT 
54 51 ACCAAGCCAT TCAAAGCTAG AATTGTTTCT TCACCTTACT TGAAAACAAA 
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5501 ATGACAACCA AAAATTGTCC CCACTGCCCT TGTACATCTT CAGATCAGTA 

5551 AAGTCCTGGG CTCAGGGATC ATTCACTTTC TTTCTTTCCT TTCACACTCA 

5601 ACTTCAGGGT AAAGGGCCTG GAGGAGCTCC TCCCAAGGAC TTGATGTACT/ 

5 651 CCGTCAACCC TACCAGAGTG GAGGACCTGA ATACATTCGG ACCAAAGATT 

57 01 GCTGGCATGT GCAGGGGCAT CCCTACCTAT GTGGCCGAGG AGATTCCAGG 

57 51 TGTGTACCCT GAGATGCTGT ATATCCCAAT GCAGTACTGA GAGAGCCATC 

58 01 AGAC ACT CT A AAGTGTGACC ACAGACGGAC CAATCATGTG GATTATCAGA 
58 51 GCAAACACTT GCTTGCTCCT TGTCAGACAG TTGTCCATGC TTCAAAAGTT 

5 901 CATTAAAAAA AATAGTTCAC AGGCTCCTCA CAGAAACCTT AGTAGAATCC 

/ 

/ 

5951 ACAGCTTCTG CTCTTAGTCT TACTTT^TTAG AAAC T GAG AC CCAGAGAAAG 
60 01 GTCACAAAAC TTTTGTCTGG CTCAGGTTCT ATGTCTTTAA CTTTATAGAA 

6051 TACCGTCTTT CTGGGTGGGT GGGCTCTAGA GTAAACTTCA AGTGAGTTCA 

/ 

6101 AGGAAAGCAT GAGAAGTAG<3 G AAG AC C AAA TGAAAGGAGA ATGCCAATGA 

6151 AATCTATCGA TTCTATAGCG CCAATGCTTA ACTCCTAGGC GTTCAAAGAA 

/ 

62 01 TAGTATCCAC AAGGTGTCAG CCTAAGATCC TAATCTAACA GCAAGTTTTC 

/ 

62 51 AGATCTCTGA AG T G AAAAG A GAAAGCAAGA GAGGAACAGA GACAGAAACA 
6301 GTAAGAGACA GAGAGGCAGA GACAAAGAGA CAGGGAGAAT AGAGAGGGAT 
6351 TAAAATTAAT ATATAGTTTA GAAATTACGA CTCCTCACAG TCCCTGCAGA 



64 01 GTCCTAGGAT AGGCACTGAT TTGGACTTCT TTTCTTCTCA CTAGGACCAA 

64 51 ACCAGCCTTT GTACTCAAAG AAGTGCTACA CAGCTGACAT ACTCTGGATT 

6501 CTGCGGATGT CCTTCTGTGG AACATCAGTG GAGACATACT AGAAGTCACA 

6551 GGAAAACAAC CCGTGGGCTC TGACCATCGC AATGCTTGAT TATGAGAGTG 
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6601 TTCTCTGGGG GTTGTGATTA GCTTCTTTAA GGCTCAATAA ACCCACGTGG 

6651 CAGCACATCC AGTTTGTAAT GACATGCCTC ATGACTTCTA TGGGAGTCCA 

67 01 ATGTGGCACC TGCCAGCCTG TATTCAGGAC CTCTCCGCTA TAAAGCATCC 

67 51 CTCCAGAGTT TTCAAATACT ACAAAGCACA GCCTGGGTTT GGGCTCAGAT 
6801 AGGCCACTGC TGCCTGACTA CAT T AC AG AC AAACAAGTTT TAAAAGAAAG 

68 51 AAAAAAGAGC TCAGAGTGGC TGGAATCAGC AAGGGTGTTT TTCCTGCAAG 

/ 

6901 GAGCCAGAAG TATCAATAAT CACCCAAGGA GGAGACACTG GGAAT GAG AG 

6951 ACTAGAACAC ACGCCTGCAG ATACGGAGAA CCTCAGCATT GCCGCTCTCT 

7 001 CCCATAACTG CACA^CCCCT TCTGTAAACT CTGCTTCTTT CTTTCACCTG 



7 051 AAGATGGCCC TTGCTTTTTT TTATTATAGG ACANGATAAC TAGACCAGAA 

/ 

7101 AGTCAACCTG 7 ACTCTCTACA TTTATATGTC TTCCCAGNTC AAGAAATATT 

/ 

7151 ATTTACTGGT GAATGGCACT TCTATATTCC CTTGGTTCAA TAAGTCTACA 



7201 GGATCCATTC ATTGACAGGC CAAGAGTGAG ATCACATGAT ACCCAAGCAC 



7 2 51 ATGGGTCTTT CCTTGAAGGA GAAGGATCCA 
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1 ATGTTCGTCGTGGGTCTGCTTGGCCTCCTTGCAGCTCCTGGTTTTGCTTACACGGTCAAC 
61 ATCAATGGTAATGATGGCAATGTAGACGGAAGTGGACAGCATTCGGTGAGCATCAATGGT 
121 GTGCACAACGTGGCCAATATCGACAACAATAAC^GCTGGGACTCCTGGAATAGCCTCTGG 

181 GACTATGAAAACAGTTTCGCTGCCACGAGACTCTTCTCCAAGAAGTCATGCATTGTGCAC 

/ 

2 41 AGAATGAACAAGGATGCCATGCCCTCCCTTCAGGACCTCGATACAATGGTCAAGGAACAG 



3 01 AAGGGTAAAGGGCCTGGAGGAGCTCCTCCCAAGGACTTGATGTACTCCGTCAACCCTACC 

3 61 AGAGTGGAGGACCTGAATACATTCGGACCAAAGATTGCTGGCATGTGCAGGGGCATCCCT 

4 41 ACCTATGTGGCCGAGGAGATTCCAGGACCAAACCAGCCTTTGTACTCAAAGAAGTGCTAC 

501 ACAGCTGACATACTCTGGATTCTGCGGATGTCCTTTTGTGGAACATCAGTGGAGACATAC 

/ 

561 TAG 
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1 MKLTMFVVGL LGLLAAPGFA YTVNINGNDG NVDGSGQQSV SINGVHNVAN 

51 IDNNNGWDSW NSLWDYENSF AATRLFSKKS CIVHRMNKDA MPSLQDLDTM 

/ 

101 VKEQKGKGPG GAPPKDLMYS VNPTRVEDLN TFGPKIAGMC RGIPTYVAEE 

/ 

/ 

151 IPGPNQPLYS KKCYTADILW ILRMSFCGTS VETY 
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1 atgcctgact tctcacttca ttgcattggt gaagccaaga tgaagttcac 
51 aattgccttt gctggacttc ttggtgtctt cctgactcct gcccttgctg 
101 actatagtat cagtgtcaac gacgacggca acagtggtgg aagtgggcag 
151 cagtcagtga gtgtcaacaa tgaacacaac gtggccaacg ttgacaataa 
201 caatggatgg aactcctgga atgccctctg ggactataga actggctttg 
251 ctgtaaccag actcttcgag aagaagtcat gcattgtgca caaaatgaag 
301 aaggaagcca tgccctccct t4agccctt gatgcgctgg tcaaggaaaa 
351 gaagcttcag ggtaaggget cagggggacc acctcccaag agcctgaggt 
4 01 actcagtcaa ccccaa'caga gtcgacaacc tggacaagtt tggaaaatcc 
451 atcgttgcca tgtgcaaggg gattccaaca tacatggctg aagagattca 
501 aggagcaaac ctgatttcgt actcagaaaa gtgcatcagt gccaatatac 
551 tctggattct taacatttcc ttctgtggag gaatagcgga gaactaa 
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/ 

MKFT I AFAGL LGVFLTPALA DYSISVNDDG NSGGSGQQSV SVNNEHNVAN 

/ 

VDNNNGWNSW NALWDYRTGF AVTRLFEKKS C I VHKMKKEA MPSLQALDAL 



VKEKKLQGKG PGGPPPKSLR YSVNPNRVDN LDKFGKSIVA MCKGI PTYMA 



/ 

EEIQGANLIS YSEKCISANI LWILNISFCG GIAEN 
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1 MKFTIAFAGL LGVFLTPALA DYSISVNDDG NSGGSGQQSV SVNNEHNVAN 

51 VDNNNGWNSW NALWDYRTGF AVTRLFEKKS CIVHKMKKEA MPSLQALDAL 

101 VKEKKLQGKG PGGPPPKSLR YSVNPNRVDN LDKFGKS IVA MCKGIPTYMA 

151 EEIQGANLIS YSEKCISANI LWILNISFCG GIAEN 
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+ 



Human 1 MKFTIVFAGLLGVFLAPALANYNIDVNDDNNNAGSGQQSVSVNNEHNVAN 50 

Pig 1 MKFT I AFAGLLGVFLT PALAD YS I SVN DDGNSGGSGQQSVSVNNEHNVAN 50 

51 VDNNNGWDSWNSIWDYGNGFAATRLFQKKTCIVHKMKKEVMPSIQSLDAL 100 

51 VDNNNGWNSWNALWSYRTGFAVTRLFRKKSCIVHKMKKEAMPSLQALDAL 100 

101 VKEKKLQGKGPGGPPPKGLMYSVNPNKVDDLSKFGKNIANMCRGIPTYMA 150 

101 VKEKKLQGKGPGGPPPKSLRYSVNPNRVDNLDKFGKSIVAMCKGIPTYMA 150 

/ 

/ 

151 EEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 185 
151 EEIQGANLISYSEKCISANILWILNISFCGGIAEN 185 
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Human 

Pig 

Mouse 



1 50 

MKFTIVF . AG LLGVFLAPAL ANYNIDVN.D DNNNAGSGQQ SVSVNNEHNV 

MKFTIAF . AG LLGVFLTPAL ADYSISVN.D DGNSGGSGQQ SVSVNNEHNV 

MKLTM.FVVG LLGLLAAPGF A . YTVNINGN DGNVDGSGQQ SVSINGVHNV 



Human 

Pig 

Mouse 



51 100 

ANVDNNNGWD SWNSIWDYGN GFAATRLFQK KTCIVHKMNK EVMPSIQSLD 

ANVDNNNGWN SWNALWDYRT GFAVTRLFEK KSCIVHKMKK EAMPSLQALD 

ANI DNNNGWD SWNSLWDYEN SFAATRLFSK KSCIVHRMNK DAMPSLQDLD 



Human 

Pig 

Mouse 



101 / 150 

ALVKEKKLQG KGPGGPPPKG LMYSVNPNKV DDLSKFGKNI ANMCRGI PTY 
ALVKEKKLQG KGPGGPPPKS LRYSVN PNRV DNLDKFGKSI VAMCKGIPTY 
TMVKEQK. . G KGPGGAPPKD LMYSVNPTRV EDLNTFGPKI AGMCRGIPTY 



Human 

Pig 

Mouse 



151 188 

MAEEMQEASL FFYSGTCYTT SVLWIVDISF CGDTVEN 

MAEEIQGANL ISYSEKCISA NILWILNISF CGGIAEN 

VAEEIPGPNQ PLYSKKCYTA DILWILRMSF CGTSVETY 
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